On european Portuguese automatic syllabification
نویسندگان
چکیده
This paper presents three methods for dividing European Portuguese (EP) words into syllables, two of them handling graphemes as input, the other processing phone sequences. All three try to incorporate linguistic knowledge about EP syllable structure, but in different degrees. Experimental results showed, for the best method, percentage of correctly recognized syllable boundaries above 99.5 %, and comparable word accuracy. The much simpler finite state transducer based method also achieved a good performance, making it suitable for applications more interested in speed and memory footprint. Being syllabification an essential component of many speech and language processing systems, proposed methods can be useful to researchers working with the EP language.
منابع مشابه
Automatic syllabification for danish text-to-speech systems
In this paper, a rule-based automatic syllabifier for Danish is described using the Maximal Onset Principle. Prior success rates of rule-based methods applied to Portuguese and Catalan syllabification modules were on the basis of this work. The system was implemented and tested using a very small set of rules. The results gave rise to 96.9% and 98.7% of word accuracy rate, contrary to our initi...
متن کاملA Rule-based Syllabification Algorithm with Stress Determination for Brazilian Portuguese Natural Language Processing
This paper presents some improvements on an existing set of linguistic rules that is capable of performing the syllabification of Brazilian Portuguese words. An algorithm was also implemented and based on this set, which improvements previously mentioned include new rules that depend on the stressed vowel to achieve the standard syllabification of some words that otherwise would be very difficu...
متن کاملThe Project HERON
In the project HERON, we developed a framework for articulatory speech synthesis for European Portuguese. The system combines several modules, developed or adapted in the project. The Linguistic Processing model uses new syllabification and grapheme-to-phone modules developed in the project. The construction of the gestural scores is performed using an adapted version of TADA (TAsk Dynamic Appl...
متن کاملAutomatic Syllabification Rules for Bodo Language
Syllabification performs the task of Identifying syllables in a word or in a sentence. Most of the syllabification tasks are done manually. As the syllabification rules vary from language to language so it is difficult to design a common syllabification rules or algorithm to fit all the languages. On the other hand Syllabification rules are the basic backbone for any task related to text-to-spe...
متن کاملAutomatic Syllabification with Structured SVMs for Letter-to-Phoneme Conversion
We present the first English syllabification system to improve the accuracy of letter-tophoneme conversion. We propose a novel discriminative approach to automatic syllabification based on structured SVMs. In comparison with a state-of-the-art syllabification system, we reduce the syllabification word error rate for English by 33%. Our approach also performs well on other languages, comparing f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005